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(57) Abstract 

An effective organization and the transferring data among the functional blocks of an integrated system (ECC-IC) of a read channel of 
data recorded on DVD-Rom, DVD-Ram, DVD-R or CD Rom for performing Reed-Solomon decoding including off-li ne he roic correction, 
or deinterleaving, Reed-Solomon decoding and correction, said integrated system including an input buffer (INPUT BUFFER), an interface 
with a microcontroller bus (/xP IF), a Reed-Solomon decoder (REED-SOLOMON DECODER), an embedded RAM (DRAM) accessed 
through a 17-bit bus, a descrambling and EDC control block (DESCR AMBLING & EDC) for D VD m odes of operation a descrambling 
block (DESCRAMBLING CD) for CD modes of operations, a data output interface (OUTPUT INTERFACE) and a t iming control block 
(TIMING CONTROL), permits the decoding of the input data acquired through said input buffer (INPUT BUFFER) at a rate of up to 
four-times the reference bit rate of DVD formatted data using a clock for accessing said embedded RAM having a frequency half that of 
the clock that is used in said Reed-Solomon decoder, while reducing the number of accesses to said embedded RAM needed to perform 
the decoding. ^ 
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REED-SOLOMON DECODING OF DATA READ FROM DVD OR CD 
SUPPORTS 

FIELD OF THE INVENTION 

The present invention relates to a read-channel circuitry of data read from a mass 
5 memory support such as a DVD or a CD and more in particular to an integrated 
system for decoding according to the Reed-Solomon algorithm data read from a 
mass memory support coded according to standard DVD-ROM, DVD-R, DVD- 
RAM or CD-ROM protocols. 

BACKGROUND 

10 DVD and CD optical supports are more and more used for storing large quantities 
of data in PC's, digital audio and video playback systems and the like. The storing 
and reading of data to and from these supports imply the coding and decoding of 
data according to standard protocols that are defined at international level (e.g. 
ISO/IEC, CEI/IEC, etc.). 

15 In write-read channel circuitry, reliability in terms of ability of detecting and 
correcting errors, especially during a phase of decoding of the coded data read 
from the support during a reading phase, and speed are of paramount importance. 
Obvious cost-effectiveness considerations call for the highest level of integration 
of system and/or subsystem circuitries in the minimum number of distinct 

20 integrated circuits. Multifunctional Reed-Solomon decoders, capable of handling 
either DVD decoding and correction or CIRC decoding and correction for all the 
commonly used CD-modes should be advantageously integrated in a single device 
including an embedded RAM required for the decoding and correction operations 
on a bitstream of input data as produced by the data aquisition means of the read 

25 channel. 
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An architectural layout of such a multifunctional decoder ECC-IC is depicted in 
Fig. 1. 

The integrated decoder handles CD modes bitstreams of any format as well as 
DVD-ROM, DVD-RAM and DVD-R mode bitstreams and advantageously should 
5 possess speed capabilities of handling bitstreams equivalent to a significantly 
large multiple of standard or base CD rates and of standard or base DVD rates. 

With reference to the functional diagram of Fig. 1, the multifunctional integrated 
decoder ECC-IC, when operating in DVD mode, performs horizontal and vertical 
decoding of the input data stream and the decoded data are then descrambled and 
10 EDC checked before sending them to an output interface circuitry. When 
operating in a CD mode, the data are decoded and deinterleaved without 
performing any C3 decoding. Finally, when functioning in a BCA mode, the 
integrated decoder may perform a Burst Cutting Area decoding of the data stream. 

The input data stream consists of the_ signals output by a data aquisition IC as 
15 depicted in Fig. 2. The signals contain data, information about the data and 
address information. 

byte_clk [1] 

The byte_clk (byte clock) signal indicates that the data byte can be read. It is 
generated once per data byte for 1 system clock cycle. 

20 erasure [1] 

The erasure bit is set to 1 if the current data byte is not a valid 16/8 modulation 
pattern (14/8 for CD) — if the pattern is valid, erasure is set to 0. 

data [8] 

The 8-bit data bus contains the demodulated data byte. 
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SID [4] 

The 4-bit SID (sector ID) contains the 4 least significant bits of the logical 
sector ID. This signal provides the sector address within each block. 

id_error [1] 

5 An id_error bit of 0 indicates that the SID was decoded with no errors and no 
corrections. If the SID contained errors (no correction possible) or if a single 
error was corrected, the id_error bit is set to 1. 

DVD/BCA frame_address[4:0] or CD SO/SI 

In DVD modes the acquisition part keeps a memory of the syncs received and 
1 0 from this history extract the 5-bit frame address. 

In BCA mode frame_address[3:0] depends from the sync found (SB BCA , 
RSbcaI, RS BCA n, RS BC a13, RSbcaK RS B ca15). 

In CD mode the SO and SI signals are sent on frame_address[0] and 
frame_address[l] respectively. 

15 CD nxfr [1] or DVD next_frame[lj 

The DVD next_frame indicates that a new DVD frame is starting. The CD nxfr 
signal indicates that a new CD frame is starting. The BCA next_frame 
indicates that a BCA Re-sync has been found. 

The timing diagrams of the input data aquisition for the case of operation in DVD 
20 mode and in CD mode are shown in figures 3 and 4, respectively. 

ECC-IC has two kinds of output interface: one for CD-modes (serial) and one for 
DVD-modes (parallel). In particular, the CD output interface may be a common I 



WO 99/48097 



4 



PCT/IT98/00056 



S interface employing a format as depicted in Fig. 5, and the subcode interface 
has as format as depicted in Fig. 6. 

The Reed-Solomon decoder block depicted in Fig. 7, supports five main modes: 

1) DVD Outer code (208, 192, 17) 8 errors or 16 erasures 

5 2) DVD Inner code (182, 172, 1 1) 5 errors or 10 erasures 

3) CD CI code (32,28,5) 2 errors 

4) CD C2 code (28,24,5) 2 errors or 4 erasures 

5) BCA code (52,48,5) 2 errors or 4 erasures 

The number of erasures that can be corrected is programmable, depending on the 
10 mode, from 13 up to 16 for DVD Outer, from 7 up to 10 for DVD Inner, from 1 
up to 4 for CD C2. The Reed-Solomon block can be programmed to make a 
severe check "for "miscorrections": This preselection will cause a reduction of the 
decoding performance. 

THE ERROR CORRECTION ALGORITHM 

15 The complete algebraic decoding algorithm for the errors and the erasures is 
summarized in the following steps: 

STEP 1 . Calculation of the syndrome S(z), the erasure locator polynomial E(z) 
and the calculation of the modified syndrome T(z). If r(x) is the received code 
word 

20 S^IX-rfX* 

i = 0 

If a* is the position of a k-erasure and e is the number of erasures 
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£(*)=n< 1 -«a*) 
k = 1 

If t is the maximum number of errors the code is able to correct 

7(z) = S(z)E(z) mod(2 2t ) 

STEP 2. Perform the extended Euclidean Algorithm (modified version) to 
5 calculate the error locator polynomial a(z) and the error evaluator polynomial 
cd (z). Calculation of the new error locator polynomial ¥(z) 

¥<r)=o(z)£(z) 

STEP 3. Perform the Chien search to find the roots of the new error locator 
polynomial. The roots of this polynomial indicate the error and erasure positions 
10 in a received code word. Perform the Forney's algorithm to find the error and 
erasure values. 

STEP 4. Check the decoding process and correct the received code word. 

The timing control block sets the control inputs of the Reed-Solomon block and 
send start syndrome pulse (start_synd) with the first symbol of the code word. 

1 5 The enjsynd is acting as an enable for the data bus (data__in). 

Every erasure should be flagged by erasure __pos. During the shifting of the code 
word, the Reed-Solomon calculates its syndrome and its erasure polynomial. 

Once the code word is completely shifted into the Reed-Solomon, the controller 
has to start the Key Equation Solver (start Jces). 
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The Reed-Solomon responds when error and location values are ready for the 
controller (kesready). 

The errors and error locations are stored in a Lifo and the controller can read them 
with read j)os and readerror signals. 

5 The Reed Solomon processing consists of three main tasks: 

a) syndrome and erasure polynomial calculation (invoked by start synd) 

b) key equation solving and error calculation (invoked by start Jces) 

c) Chien & Forney (generating the ending signal kes_ready) 
as depicted in the block diagram of Fig. 8. 

10 In DVD Outer decoding, the erasure polynomial is equal for each column, 
because it is calculated using the incorrectable flag coming from the Inner 
decoding. For this reason it is calculated only once for each Ecc block of data at 
the beginning of the vertical decoding process; the resulting polynomial is stored 
depending on the store_eras _poly signal. This is depicted by way of a block 

15 diagram in Fig. 9. 

During the DVD decoding process (Inner - Outer - Inner - Outer - .. ) the signal 
seljeras j>oly send to Key Equation Solver the current erasure polynomial (Inner) 
or the previous stored erasure polynomial (Outer). 

SCOPE AND SUMMARY OF THE INVENTION 

20 To overcome the intrinsic low speed of embedded DRAMs it has been found that 
a considerable increase of the required processing speed capabilities in a fully 
integrated decoder (ECC-IC) may be obtained by organizing the data flow within 
the integrated decoder and the embedded RAM in a way as to reduce the 
number of accesses to the embedded RAM needed to perfom the decoding at the 
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required speed while using the Reed-Solomon decoding block at twice the 
maximum clock frequency allowed by the embedded RAM (25MHz). 

Accordingly, in DVD decoding, the syndrome engine is made capable to process 
two code words at the same time, by storing the final polynomials into distinct 
registers: two for storing vertical syndromes and one for storing horizontal 
syndromes, thus implementing a kind of parallel processing in the decoding 
phase. 

Accordingly, in CD decoding, a new addressing scheme has been found to 
perform very high speed M2 deinterleaving, minimizing the number of accesses 
to the embedded DRAM required to perform the deinterleaving of data. 

The embedded DRAM is organized in distinct banks, each divided into a number 
of pages of certain capacity of words, functioning in a synchronous page mode. 

The method and architecture of the multifunctional decoder of the invention are 
defined in the appended claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is an architectural layout of the multifunctional decoder ECC-IC of the 
invention; 

Fig. 2 shows the input data coming from a data acquisition IC; 

Fig. 3 and 4 show the timing diagram of the input data acquisition for the case of 
20 operation in DVD mode and in CD mode, respectively; 

Fig. 5 and 6 illustrate the output serial I 2 S data interface for CD mode and the 
output serial subcode interface for CD mode, respectively; 

Fig. 7 represents the Reed-Solomon Decoder block; 
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Fig. 8 schematically shows the processing performed by the Reed-Solomon 
Decoder; 

Fig. 9 schematically illustrates the processing of erasure flags during horizontal 
and vertical DVD decoding; 

5 Fig. 10 shows the parallel processing of partial syndrome values; 

Fig 11 shows the timing diagram of the embedded DRAM working in a 
synchronous page mode, for a word access in first cycle; 

Fig. 12 shows the functional blocks of the ECC-IC device that are involved with 
the data flow in a DVD mode of operation; 

10 Fig. 13 illustrates schematically the different phases of processing (DVD mode); 

Fig. 14 shows the particular mapping adopted for the embedded DRAM for one 
ECC-block of data; 

Fig. 15 illustrates the output processing; 

Fig 16 illustrates the processing referred to a complete elaboration time frame, 
15 divided in four steps; 

Fig. 17 shows the functional blocks of the ECC-IC device that are involved with 
the data-flow for the case of a CD mode of operation; 

Fig. 18 illustrates the first step of the deinterleaving (Ml-deinterleaving) of input 
data; 

20 Fig. 19 illustrates the particular mapping inside a page of the embedded DRAM 
(M2-deinterleaving); 

Fig. 20 shows deinterleaved frames read from the DRAM; 
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Table 1 illustrates the particular mapping of the two banks of DRAM (M2- 
deinterleaving) refering to the input frame numbers. 

Fig. 21 depicts the enable signal according to the different frame numbers; 

Fig. 22 shows the particular mapping inside a page of the embedded DRAM (M3- 
5 deinterleaving); 

Fig. 23 illustrates schematically the different phases of processing (CD mode); 

Fig. 24 illustrates how one block of 12 words is read from memory for each input 
frame; 

Fig. 25 shows the functional blocks of the ECC-IC device that are involved with 
10 the data flow during operation in BCA mode; 

Fig. 26 illustrates the mapping of the embedded DRAM (BCA mode); 

Fig. 27 illustrates schematically the different phases of processing (BCA mode) 

Fig. 28 depicts the descrambling architecture for DVD mode of operation; 

Fig. 29 depicts the descrambling architecture for CD mode of operation; 

15 Fig. 30 shows the functional block diagram of the EDC checker; 

Fig. 3 1 depicts the way the new CRC value is produced by shifting the old value 
and xoring the result with the updating value. 
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DESCRIPTION OF AN EMBODIMENT OF THE INVENTION 

An integrated multifunctional Reed-Solomon decoder according to the general 
characteristics already described above and embodying the features of the 
invention, implementing the method of the invention of organising and 
5 transferring data among the functional blocks of the integrated decoder will now 
be described with reference to the drawings. 

With reference to the diagram of Fig. 10, the synd.2 value (rs_reg[2]), if set, 
selects for each byte_clk a different syndrome register (SKI, SK2) to store the 
polynomials related to the two different columns. The sel_synd j)oly signal 
10 chooses which one of the stored syndrome values is to be sent to Key Equation 
Solver (KES). 

TIMING CONTROL BLOCK 

According to the sample embodiment taken into consideration, the following 
assumptions were made: 

15 • The system clock speed is 5 0MHz. 

• The symbols coming from the data acquisition block are stored into an input 
memory working with the system 50MHz clock. 

• The embedded DRAM works in a synchronous page mode with a 25MHz 
clock. 

20 • DVDx4 and CDx32 are taken as target speed capabilities for the relative modes 
(two steps of decoding for DVD modes and complete CIRC decoding for CD 
modes). 

• The byte clock coming from the data acquisition device is always available. 
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• At the output of ECC-IC only complete Ecc blocks of data will be output. 

The characteristics of the DRAM embedded in the ECC-IC chip were the 
following: 

• 1 -Transistor Planar memory cell. 

5 • Planar Memory cell area: 1 1.2 mm2. 

• Organized in three banks of 16K words of 17 bits, split into 512 rows of 32 
words of 17 bits. 

• Byte- Write capability available, i.e. write operation can be applied to any of 
the two bytes. Read operation is always done on 17 bits. 

10 • 512-cycle refresh distributed across 1ms. 

• No Redundancy 

• Latched Output Data. 

• Bidirectional data bus. 

• On-chip voltage generators 

15 • Separate I/O data buses with write-through enable pin. 

During page modes, row addresses are latched for the duration of whole row 
access, i.e until precharge is performed, while column addresses are latched only 
during the actual column access. 

Row is accessed first, then words in that row (or page) at a rate of one word per 
20 cycle. An extra cycle is required to go back to precharge before accessing a new 
row. 
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There is no specific Refresh cycle. Any Read or Write operation is also a refresh 
cycle. 

The timing diagram of the embedded DRAM working in a synchronous page 
mode, for a word access in first cycle, is shown in Fig. 11. 

5 The Timing Control block is responsible of scheduling data transfer among the 
following functional blocks that compose the ECC-IC of Fig. 1 : 

• Input interface 

• Input memory 

• Reed Solomon decoder 

1 0 • DRAM (embedded in the IC) 

• Descrambling and EDC checker (for DVD mode of operation only) 

• Descrambling (for CD modes of operation only) 

• Output interface 

DVD MODE OF OPERA TION 
15 Processing of data starts each time a couple of rows has been received. 
The decoder ECC-IC performs the following operations: 

• horizontal correction of 2 rows, Ecc-block(#n). 

• vertical correction of 2 columns, Ecc-block(#n-l) (this operation starts only 
when the horizontal correction of this Ecc-block has been completed). 

20 • output of 2 rows, Ecc-block(#n-l) or Ecc-block(#n-2) (this operation starts 
when the vertical correction of the Ecc-block has been completed). 
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In order to support a sequential reading, the ECC-IC must complete the above 
processing before a new couple of rows is received. 

The minimum time frame when operating in a DVDx4 mode is 56.875usec 
(2xl82/6.4MHz). 

5 Fig. 12 shows the functional blocks of the ECC-IC device that are involved by the 
data flow in a DVD mode of operation. 

Normal mode 

Data and erasure flag coming from the data acquisition IC are sampled into 
Input interface and stored frame after frame into one of the two Input RAMs. 

10 Each time 4 frames (equivalent to two rows) have been received, the input is 
switched on the second RAM; the previous rows are read from this memory and 
sent both to the Reed-Solomon decoding block, to calculate Inner syndromes and 
corresponding erasure polynomial, and to the embedded DRAM, where two 
complete ECC blocks of data are stored. 

1 5 ECC-IC performs at the same time the Inner decoding of Ecc block #n and the 
Vertical decoding of Ecc block #(n-l). 

. Flags coming from Inner decoding are stored into DRAM to be used later in 
Outer decoding: for Row n , the erasure flag is stored into the (Row n , Col 182 ) word. 
Horizontal corrections are effected (up to 5 errors or 10 erasures) within the 
20 DRAM. 

While storing rows into one Ecc block in the DRAM, columns of the previous 
Ecc block are being decoded. Each time a word is read from DRAM, bytes of two 
different columns are retrieved, therefore the syndrome engine is able to calculate 
syndromes of both columns at the same time. 
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Vertical corrections are effected (up to 8 errors or 16 erasures) within the DRAM. 

When an Ecc block is completely decoded vertically, the output processing 
begins: data are read by rows from DRAM and sent to the descrambler block, and 
successively to the EDC checking block and finally to the output interface block. 

5 Each time EDC fails, a bit is set into a register (16 bits), containing the results of 
each sector. At the first fail an interrupt is generated towards the piP. 

Cycles are inserted to guarantee complete refreshing of the embedded DRAM. 

The different phases of processing are schematized in Fig, 13. 

Each Elab frame starts only when two complete rows have been received and are 
10 ready to be horizontally decoded and stored into DRAM. For this reason the byte 
clock should never disappear, in order to avoid arresting the internal processing. 

In the time frame two new rows are received, two rows and two columns are 
decoded. Therefore, the Elab time frame must be" shorter than twice the shortest - 
Input data time frame (DVDx4) = 2*(28.43 usee). 

15 In view of this length of each Elab frame, the decoding core will be always 
waiting for the rows_ready signal. If for any reason (as for example an unlocked 
DPLL) the Input data frame will be shorter than Elab frame the latest stored row 
will be overwritten in the Input RAM by data coming from the data acquisition 
IC. 

20 With a DRAM functionning in synchronous page mode, each time a new page has 
to be accessed, one extra cycle is needed to perform the pre-charging of the 
page.In one page of DRAM,, 16 bytes per row and 12 bytes per column are 
stored: lrow = 12 pages, 1 column = 18 pages. To store 2 Ecc blocks 432 pages 
are necessary. 32 rows are refreshed during each Elab frame. 
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At the end of each Elab frame, the core waits for the next two rows to decode, 
while refreshing the DRAM. As soon as a new couple of rows is received ready to 
be decoded, the process restarts. 

5 Heroic Correction 

If for any reason the output data are still affected by errors, the jjP may decide to 
initiate a more effective decoding of a specific Ecc block of data. 

The \xY can decide how many iterations of decoding should be done (Inner - 

Outer - Inner - Outer - - Inner - Outer); the maximum number of ECC-blocks 

10 is written in a ECC_block register, the number of iteration is written into a nb_iter 
register and the process is started setting another bit in the same register (HC). 

Data coming from the data acquisition IC are horizontally and vertically corrected 
as in normal mode. The flags generated from the vertical decoding are stored into 
DRAM to be used as erasure flags in the horizontal decoding step of the selected 
15 Ecc-block. 

At the end of the vertical decoding the ECC-block number is compared with the 
value written in the Ecc_block register. Ifthe ECC-block number matches, the 
input is disabled and the next step of decoding takes place on this ECC-block: 
for Col n the erasure flag is stored into the (Row 208 , Col n ) word. 

20 Data are read towards the DESCRAMBLING, EDC AND OUTPUT 
INTERFACE blocks and, at the same time, horizontal syndromes are calculated 
and corrections applied within the DRAM. 

At the end of the Ecc block, if any of EDC check bit is set and if the maximum 
number of iterations is not reached the process go ahead with another vertical 
25 decoding step and so on. 
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When all the EDC checks are right or when the maximum number of iterations 
has been reached, an interrupt is generated and the processing is stopped. 

The algorithm may be expressed as follows: 

STEP1: read Ecc blocks from disk. Normal decoding of data with check, after 
5 vertical step, of the Ecc-block number. No output of data during this step. This 
phase is repeated until the right Ecc-block number will be found. When this 
happens, the input of following data will be disabled. 

STEP2: Output data + EDC check + Inner correction (use flag coming from 
outer decoding as erasure flag). 

10 if (EDC is OK) OR (maximum number of iterations is reached) 

STOP 

else 

vertical correction 
repeat this step 
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Long Mode 

If a specific bit is set into a dedicated register (longm), the ECC-IC may work in 
long mode: the decoding process is still active but no data correction is effected 
and all bytes received from acquisition (user bytes + parity bytes) are sent to the 
5 output interface. 

Input RAM 

Two static Ram, each one 2 rows wide (2xl82x9bits), are employed. 

While data are stored into one RAM frame after frame, the previous two rows are 
read from the other RAM. Only when two rows are completely built, the transfer 
10 to Reed-Solomon decoder and to the embedded DRAM begins. 

For each location 1 byte and its erasure flag coming from the demodulation 
process are stored. 

Embedded DRAM 

3 banks of 16Kxl7 bits of DRAM memory are employed. Each bank is split into 
15 512 rows of 32 words of 1 7 bits. 

In DVD mode the three banks are addressed as a single memory; each ECC block 
is stored according to the following mapping: in each page (3x32xl7bits) 16 bytes 
of one row and 12 bytes of one column are stored. 

To store or to retrieve a complete row, 12 pages must be addressed, to read a 
20 complete column, 1 8 pages must be addressed. 

In this mode, the 17th bit of each location becomes meaningless. 

An extra-row (Row 2 o 8 ) and an extra-column (Col 182 ) are needed to store erasure 
flags coming from Inner and Outer decoding. 
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According to an important aspect of this invention, an effective mapping becomes 
intrumental in reducing as much as possible the number of accesses to the 
DRAM. 

By operating in a synchronous page mode, a cycle is lost at each change of page, 
5 because a precharge cycle is normally needed. 

By contrast, if when changing page the bank of DRAM is also changed, no extra 
cycle is needed. In practice, in writing a row, the page and also the DRAM bank 
are changed: every 16 bytes (bankO - bankl - bank2 - bankO - ... and so on). 

Therefore, in retrieving a column from the embedded DRAM during the 
10 decoding, it is always true that a page change occurs only when changing bank 
(from bank2 to bankO), thus there are no extra cycles. 

Extra cycles are needed only during horizontal and vertical correction, because in 
those operations, it may happen to be changing page while remaining in the same 
bank of the embedded DRAM. 

15 Of course, also during refresh there occurs an extra cycle, because all the banks 
are refreshed at the same time (dummy read). 

The particular mapping of the embedded DRAM for one Ecc_block of data is 
depicted in Fig. 14. 

To store one Ecc_block 216 pages are needed. Refresh is effected only for 432 
20 pages of the DRAM because two Ecc_blocks of data are needed for the 
processing. 



WO 99/48097 



19 



PCT/IT98/00056 



Output interface - DVD modes 

Decoded sectors are read from DRAM row after row, descrambled and EDC 
checked if needed. The pages are read from memory (max 16 bytes, min 6 bytes) 
and stored into the output buffer (24 bytes). 

5 When SREQI is asserted by an ATAPI-IC, data available in this buffer are read 
and sent to the output. 

During the processing of input data the Timing Control samples the status bit of 
the output buffer to be ready to store another page of decoded data read from the 
DRAM as soon as enough data are sent to the ATAPI-IC 

10 Until the Elab frame is finished, 2 rows as maximum can be trasmitted to the 
output buffer. Instead at the end of internal processing and during refresh cycles, 
as many data as required by the ATAPI-IC are sent to the output buffer. For this 
reason decoded ECC blocks are sent to the ATAPI-IC always at the maximum 
speed allowed by SREQI signal. 

1 5 The output process is depicted in Fig. 15. 

Parallel Vertical Syndromes Computation 

During vertical decoding, data are retrieved from DRAM word after word; in 
each word a byte belonging to one column and another belonging to the next one 
are identified. The Reed-Solomon decoder decodes at the same time data of the 
20 input rows and data of these columns. 

The processing is depicted in Fig. 16. The Elab frame (computation time frame) 
may be divided in four steps: 

STEP1. 

- Horizontal syndrome computation (row#m, Eccb#n) 
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- Horizontal erasure polynomial computation (row#m, Eccb#n) 

- Vertical syndrome decoding (col#p, Eccb#n-1) 
STEP2. 

- Horizontal syndrome computation (row#m+l, Eccb#n) 

5 - Horizontal erasure polynomial computation (row#m+l, Eccb#n) 

- Horizontal syndrome decoding (row#m, Eccb#n) 
STEP3, 

- Vertical syndromes computation (col#p+2,p+3, Eccb#n-1) (to be continued) 

- Vertical syndrome decoding (col#p+l, Eccb#n-1) 
10 STEP4. 

- Vertical syndromes computation (col#p+2,p+3, Eccb#n-1) 

- Horizontal syndrome decoding (row#m+l, Eccb#n) 

During step3 and step4, the first column of the Eccb#n-1 to be read from DRAM 
is Col 182 , containing the erasure flags from horizontal decoding from which the 
15 vertical erasure polynomial is updated and stored too. 
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CD MODES OF OPERATION 

The functional circuit blocks of the ECC-IC involved into CD data flow are 
shown in Fig. 17. 

In CD-modes the 3 banks of the embedded DRAM are used as three different 
5 memories: M2-deinterleaving is carried out in two of the banks, while M3- 
deinterleaving is performed in the third bank. 

Data received from the data acquisition IC is stored into Input RAM (user data) or 
sent to output interface (subcode data and sync). For each F3 frame 32 bytes of 
user data are stored, each one with its erasure flag. 

10 The first step of the deinterleaving (Ml-deinteleaving) takes place into the Input 
RAM according to the sheme of Fig. 18. 

Data are read from this memory deinterleaved, sent to Reed-Solomon decoder and 
stored into the correction FIFO. In this step the Reed-Solomon core calculates the 
CI syndrome of the input frame. 

15 Meanwhile, the previous frame stored into the FIFO is corrected on the fly and 
stored into the two banks (DRAM1, DRAM2) of the DRAM, organized in 512 
pages of 32*17 bits each, according to a mapping as schematized in Fig. 19: the 
28 bytes frames are stored in one DRAM page together with its CI decoding flag. 
In a page of one bank of DRAM two 28 bytes-frame can be stored. The M2- 

20 deinterleaving takes place in these two banks of DRAM. 

According to an important aspect of this invenction, an effective addressing of the 
incoming frames together with the mapping of Fig. 19 becomes instrumental in 
reducing as much as possible the number of accesses of the DRAM, allowing to 
perform a very high speed M2-deinterleaving. 
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1 12 frame need to be stored in the two banks, requiring 28 pages of DRAM1 and 
DRAM2: therefore only these pages need to be refreshed. 

Tab. 1 shows the content of the two banks (DRAM1, DRAM2) after 112 frames 
have been received. Depending on the circular approach chosen, frame#113 and 
frame #121 will be stored in page#0 of DRAM1 and so on. 

Functioning in a synchronous page mode, the embedded DRAM needs a cycle for 
precharge each time a new page is addressed. 

Frames coming from the correction FIFO are corrected on the fly and stored in 
DRAM1 or into DRAM2: 32 corrected bytes are read from the FIFO but only 28 
bytes are stored into DRAM, because the parity bytes can be discarded. 

The operations performed when a frame is written into the right DRAM bank are: 

- select the right bank and the right page 

- write 14 words (17 bits) into the memory 
To write a complete frame we need (1 + 14)cycles. 

Data for C2 syndrome calculation (28 bytes + flags) are read from M2 memory: 
the new mapping of the data allows to reduce the number of accesses of the 
DRAM needed to retrieve both deinterleaved data and correspondent flags. 

Fig. 20 shows that to retrieve a complete deinterleaved frame (1+2)* 8=24 DRAM 
cycles are needed. In fact, to read a deinterleaved frame, 8 pages are addressed 
and 32 bytes retrieved: only 28 bytes are significant. 

Firstly, the page where to start from must be decided, according to 
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where "frame" stands for the frame number. 

From each selected page 4 bytes in 2 cycles are read, two from DRAM1, two 
from DRAM2: for example, if page#0 is selected, in the first cycle a byte from 
5 frame#l (DRAM1) and one from frame #5 (DRAM2) are read, in the second 
cycle a byte from frame #3 (DRAM1) and one from frame #9 (DRAM2) are read. 

Every time 1 bytes from a page are retrieved, the page number is incremented 
according to 



10 At this point, from the 32 bytes read, the 28 significant bytes are chosen: they are 
always consecutive but the first significant byte change position each 4 frames, 
still remaining in the first selected page. 



The enable signal is shown in Fig. 21, according to the different frame numbers. 
15 The waveform of this signal repeats itself every 16 frames. 

M2-deinterleaved data are read from DRAM, sent to the Reed-Solomon decoder 
and stored into the correction Fifo again. In this step the Reed-Solomon core 
calculates the C2 syndrome. 

20 The previous frame stored into the Fifo is corrected "on the fly" and stored into 
the M3 memory bank. Deinterleaved data are read from the M3 bank and sent to 
the output interface. 



nextPAGE = {PAGE + 4) mod(28) 
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The third step of deinterleaving is done into the third bank (M3) of DRAM, 
organized in 512 pages of 32*17 bits each, by using only two pages for the 
process. 

To perform this third step of deinterleaving four frames coming from C2 
5 decoding are stored, each byte coming with its flag of correct/incorrect decoding. 

First of all, two bytes are packed in one word ready to be written into M3, 
therefore for each couple of bytes only 1 flag in 17th position is written. 

The mapping of the M3 DRAM memory is shown in Fig. 22. 

The operations performed when a frame is written into the M3 memory are: 

1 0 - select the right page 

- write 12 words (17 bits) into the memory 
To write a complete frame we need (1 + 12) cycles 

The operations performed when a frame is read from the M3 memory are: 

- select the right page 

15 - read 12 words (17 bits) from the memory, two words from frame (N) 

then two words from frame (N-2) 

To read a complete frame we need (1 + 12) cycles. 

The Elab_frame is illustrated in Fig. 23. 

The elaboration time frame must be shorter than the shortest Input time frame (for 
20 CDx32 rate it is of 4.25 usee). 

The Elab frame start only if there is a new F2-frame to be decoded. 
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The CI -syndromes update time depends from the speed of the disk, because the 
Input memory is shared between the input process and the CIRC decoding. 

The refresh of twenty eight pages is done at the same time on the three banks of 
the DRAM. 

5 At the end of an Elab frame refresh cycles are inserted until a new F2-frame is 
ready for decoding. 

Output of data in CD modes 

Data are read from the third bank of the DRAM (M3), frame after frame, and are 
sent to the CD-descrambler (if enabled) and to output interface. 

10 One blocks of 12 words is read from memory for each input frame as illustrated 
in Fig. 24. Each byte read from DRAM is sent to the output buffer. 

BCA MODE OPERA TION 

BCA is an area for recording information after finishing disc manufacturing 
process. The data in the BCA-Code consists of a BCA-preamble field, a BCA- 
15 Data field and a BCA-Postamble field. The length of BCA-Data field is (16n-4) 
bytes, with 1< n ^12. These bytes are encoded with a Reed-Solomon code 
(52,48,5) and protected by a 32 bits CRC. 

Fig. 25 shows the functional blocks of the ECC-IC device that are involved by the 
data flow in a BCA mode of operation. 

20 A SRAM (364x9bits) is used for storing the incoming data. The sync information 
is carried by frame_addr[4:0]. 

Data are stored into Input RAM to reconstruct the complete block. When the last 
sync (RS B cai5) ° r the postamble pattern is received, the decoding process starts. 
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Bytes are sent to the embedded DRAM and to the Reed-Solomon decoder to 
calculate the syndromes of the four code words. During this transfer, zeroes are 
inserted to fit the code word length (if required). All the bytes (information + 
parity bytes = 4*(48+4)bytes) are stored into DRAM, following the same 
5 addressing scheme as in DVD-modes: code word 0 is equivalent to RowO, code 
word 1 to Rowl, code word 2 to Row2, code word 3 to Row3. The mapping is 
illustrated in Fig. 26. Only 4 pages of DRAM need to be refreshed. 

Corrections are applied into the DRAM. 

Data are read towards the EDC checker to test the result of the decoding. The 
10 EDC flag is stored into an edc_register[0]. 

The end of processing is signalled to |iP by interrupt. The |iP will take care of 
transferring data to the AT API buffer. 

The Elab frame in BCA modes is illustrated in Fig. 27. 
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Acquisition and Timing Control - BCA heroic-mode 

If sync detection problems are experienced in the acquisition of data, the \xP may 
decide to do the demodulation of BCA by firmware. Setting the HC bit in |iP 
interface, data received from the data acquisition IC are simply stored into the 
5 Input RAM. When the last byte is received, the \xP begins to read all the blocks 
from this memory. After earring out demodulation and formatting of the data 
blocks, data are written back to Input RAM and the decoding process starts (as in 
normal BCA mode) when the Start bit in the |oP interface is set by the |oP. 

DVD DESCRAMBLER 

10 Data are descrambled just before reaching the EDC checker and Output interface. 
Bytes coming from the embedded DRAM after Reed-Solomon decoding are 
descrambled "on the fly" and then passed to the EDC checker. 

The basic descrambling operation is to forward one byte unchanged if it does not 
belong to the main data area, or to perform another XOR operation with the right 
15 scrambling value to retrieve the original data. The scramble value must be 
generate internally. 

Each already error-corrected and deinterleaved data sector is treated separately 
(during descrambling), sequentially byte after byte (as part of an entire Ecc_block 
of data). 

20 The overall architecture of the Descrambler for DVD mode is depicted in Fig. 28. 

The main input is the sequential stream of data bytes. The row and column 
indexes of the current input data byte are needed for algorithm control. From such 
information principally the Sector_start (first byte of a new data sector) and the 
Blockjstart (first byte of a new Ecc block) must be retrieved. 
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The Desclamber is provided with the 50MHz master clock and with a data_enable 
signal indicating that an input data has to be processed. 

The main output is a stream of descrambled bytes, accompanied by control 
signals necessary for the EDC checker and output interface. 

5 The Descrambler processes only DVD-data, depending on the format of the input 
data. In the other modes of operation, the data bytes are simply conveyed to the 
output. 

CD DESCRAMBLER 

Data (not audio) coming from CIRC decoding must be descrambled before going 
10 to C3 decoding. This operation may be done inside ECC-IC or in an ATAPI-IC. 
A bit written in a register (en_des) enables/disables this function. 

The incoming byte are xored with a scramble value generated by a FSR, as shown 
in the functional block diagram of Fig. 29r 

The 15-bit register is of the parallel-block synchronized type and fed back 
1 5 according to the polynomial x 15 + x + 1 . 

After the Sync of the sector (0x00, OxFF ten times, 0x00) has been found, the 
register is preset with the value 0x0001, where 1 is the least significant bit. A 
symbol counter will count 2340 bytes to be descrambled. 

Each time a Sync pattern is detected a preset is generated (the Sync window is 
20 always active) and the counter reset to 2340 (Short Sectors). 

Each time the counter reaches zero, a preset signal is generated anyway, even if 
no Sync pattern is found. The counter is reset to 2340 (Long Sector). 

Each time a new byte is read, the FSR value St[15:0] is updated as follows: 
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S t+ i[15]= S t [8]xorS t [7] 

S t+ i[H] = S t [7]xorS t [6] 

S t+1 [13] = S t [6]xorS t [5] 

S t+1 [12]= S t [5]xorS t [4] 
5 S t+1 [ll]= S t [4]xorS t [3] 

S t+ ,[10]= S^lxorS^J 

S t+1 [ 9] = S t [2]xorS t [l] 

S t+1 [ 8]= S t [l]xorS t [0] 

S t+ ,[7:0]= S t [15:8] 
10 EDC CHECKER 

« 

The EDC is a CRC check which , in DVD-mode, is applied to the sectors after the 
descrambling has been performed. In BCA mode the EDC check is effected to test 
the result of decoding. 

BCA Mode 

15 A decoded BCA block consists of 16xn bytes, with 1< n <12. The last four bytes 
are parity bytes, obtained dividing the information bits by the generator 
polynomial: 

g(x) = x 32 +x 31 +x 4 +l 

At the beginning of each block a start signal resets the EDC value to zero. With 
20 each enable signal, data is read from the embedded DRAM to the EDC. 

The current value of EDC is updated each time a new byte is received. 
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After 192 bytes the check is stopped and a result flag is set to zero if the check 
was satisfactory. Otherwise this flag is set to 1. The EDC doesn't change the data 
in any case. 

The result flag of the block is stored into a register edc_reg[0], and an interrupt is 
5 generated towards the piP to signal the end of BCA processing. 

The |xP will take care to read the decoded bytes from internal DRAM to an 
AT API buffer. 

DVD Mode 

Each sector consists of 12 rows* 172 bytes. The EDC checker gives a correct 
10 check if the error-correction has performed well. Of course, this check has no 
100% safety, but a high probability that incorrect data be recognized. 

At the beginning of each sector, the start signal resets the EDC value to zero. 
With each enable- signal data is shifted from the DVD Descrambler to the EDC 
checker. 

15 The current value of EDC is updated each time a new byte is received. 

After 2064 bytes the check is stopped and the result flag is set to zero if the check 
was satisfactory. Otherwise this flag is set to 1 . 

All the result flags of the sectors inside an Ecc_block of data are stored into a 
register edcjreg[15:0]; when the first EDC failure occurs, an interrupt is 
20 generated towards the jxP; the content of this register will be reset when the first 
byte of the next Ecc_block will be output. 

A functional block diagram of the EDC Checker is shown in Fig. 30. 

Four bytes are attached to 2060 bytes in a data sector. Let's suppose that the MSB 
of the first byte of the sector is b 1651 1 and the LSB of the last byte of EDC is bO. 
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The generator polynomial of the code is 



#(jc) = x 32 +x 31 +x 4 +1 



and the information polynomial is 



32 



/(*) = 2>«*' 



i = 16511 



10 



The implemented algorithm works directly on bytes. This allows to reduce the 
frequency of the clock needed in this processing. 

An internal ROM storing 8 words of 32 bits is used to update the CRC value each 
time a new byte is received. The stored values are obtained as remainder of the 
division 



calculated in GF(2). 

At the beginning of the sector the value of CRC is initialized to 0x0000. The input 
byte is xored with the most significant byte of the current CRC value: each "1" in 
the resulting byte enables one stored value; all the enabled bytes are xored to give 
an updating value. The new CRC value is obtained shifting the old one and xoring 
the result with the updating value. This processing is illustrated in the diagram of 
Fig. 31. 
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CLAIMS 

1. A method of organizing and transferring data among the functional blocks 
of an integrated system (ECC-IC) of a read channel of data recorded on DVD- 
Rom, DVD-Ram and DVD-R for performing Reed-Solomon decoding including 
5 off-line heroic correction, said integrated system including an input buffer (Input 
BUFFER), an interface with a microcontroller bus (pP IF), a Reed-Solomon 
decoder (REED-SOLOMON DECODER), an embedded RAM (DRAM) accessed 
through a 17-bit bus, a descrambling and EDC control block (DESCRAMBLING & 
EDC) for DVD modes of operation, a data output interface (OUTPUT INTERFACE) 

10 and a timing control block (Timing Control), characterized in that the decoding 
of the input data acquired through said input buffer (INPUT BUFFER) is performed 
at a rate of up to four-times the reference bit rate of DVD formatted data using a 
clock for accessing said embedded RAM having a frequency half that of the clock 
that is used in said Reed-Solomon decoder, while reducing the number of accesses 

15 to said embedded RAM needed to perform the decoding, the method comprising 
the steps of: 

a) organizing DVD mode input data in two blocks or rows of 182 bytes each; 

b) storing each input block in 12 pages of distinct banks in which said embedded 
RAM (DRAM) is divided, functioning in a synchronous page mode, each bank 

20 being split into 512 pages of 32 words of 17 bits, by organizing bytes into 
words of 16 bits, sequentially readable therefrom without waiting for any 
precharge cycle; 

c) decoding simultaneously two input blocks or rows in said Reed-Solomon 
decoder (Reed-Solomon Decoder), correcting errors and storing decoding 

25 flags in said embedded RAM (DRAM); 
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d) refreshing said embedded RAM (DRAM) while the decoding of two blocks is 
in progress; 

e) decoding two columns of 208 bytes each of said stored input blocks by reading 
said columns word by word from said embedded RAM (DRAM) and 

5 performing parallel Reed-Solomon decoding on two columns at the time in 
said Reed-Solomon decoder (Reed-Solomon Decoder), correcting errors and 
storing decoding flags in said embedded RAM (DRAM). 

2. A method of organizing and transferring data among the functional blocks 
of an integrated system (ECC-IC) of a read channel of data recorded on a CD 

10 Rom for performing deinterleaving, decoding and correction, said integrated 
system including an input buffer (INPUT BUFFER), an interface with a 
microcontroller bus (|nP IF), a Reed-Solomon decoder (REED-SOLOMON 
DECODER), an embedded RAM (DRAM) accessed through a 17-bit bus, a 
descrambler (Descrambling CD), a data output interface (OUTPUT INTERFACE) 

15 and a timing control block (Timing Control), characterized in that the 
processing of the input data acquired through said input buffer (INPUT BUFFER) is 
performed at a rate of up to 32 times the reference bit rate of CD formatted data 
using a clock for accessing said embedded RAM having a frequency half that of 
the clock that is used in said Reed-Solomon decoder, while reducing the number 

20 of accesses to said embedded RAM needed to perform the processing, the method 
comprising the steps of: 

a) storing CD mode frames of input data including erasure flags in said input 
buffer (INPUT BUFFER) and conveying subcode and sync data to said output 
interface (OUTPUT INTERFACE) 

25 b) performing a first step of deinterleaving (Ml -deinterleaving) in said input 
buffer (INPUT BUFFER) reading frames of input data therefrom, inputting the 
frames to said Reed-Solomon decoder (Reed-Solomon Decoder) that 
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calculates the CI syndrome and storing the data frames in a correction fifo 
buffer while correcting errors on the fly of a previously stored frame and 
storing it in two banks of said embedded RAM; 

c) performing a second step of deinterleaving (M2-deinterleaving) in said two 
5 banks of three distinct banks in which said etiibedded RAM is divided, 

functioning in a synchronous page mode, each bank being split into 512 pages 
of 32 words of 17 bits, by organizing bytes and CI -decoding flags into words 
of 17 bits, by storing with a new addressing scheme data in 28 pages subject to 
refreshing, each page containing two of said frames, by reading frames of 
10 deinterleaved data and flags from said two banks, inputting the frames to said 

Reed-Solomon decoder that calculates the C2 syndrome and storing the data 
frames in a correction fifo buffer while correcting errors on the fly of a 
previously stored frame and storing it in a third bank of said embedded RAM; 

d) performing a third and last step of deinterleaving (M3-deinterleaving) in the 
15 third one of said three "distinct banks in which said embedded RAM is divided, 

using two pages for the process, including the reading four frames of 
deinterleaved data from said two banks, each byte with its own flag of 
correct/incorrect decoding, and packing two bytes in one word to be written in 
the same third bank of said embedded RAM; 

20 e) refreshing 28 pages of each of said three banks of embedded RAM 
simultaneously while the processing of one frame is in progress; 

f) reading frames from said third bank of embedded RAM and inputting them to 
said descrambler (DESCRAMBLING CD) toward said output interface. 
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